Reducing Decision Tree Ensemble Size Using Parallel Decision DAGs
نویسندگان
چکیده
This research presents a new learning model, the Parallel Decision DAG (PDDAG), and shows how to use it to represent an ensemble of decision trees while using significantly less storage. Ensembles such as Bagging and Boosting have a high probability of encoding redundant data structures, and PDDAGs provide a way to remove this redundancy in decision tree based ensembles. When trained by encoding an ensemble, the new model behaves similar to the original ensemble, and can be made to perform identically to it. The reduced storage requirements allow an ensemble approach to be used in cases where storage requirements would normally be exceeded, and the smaller model can potentially execute faster by reducing redundant computation.
منابع مشابه
Fault Detection in Ring Based Smart LVDC Microgrid Using Ensemble of Decision Tree
In modern infrastructure, the demand for DC power-based appliances is rapidly increasing, and this phenomenon has created a positive impact on the acceptance of the DC microgrid. However, due to numerous issues such as the absence of zero crossing, bidirectional behaviour of sources, and different magnitudes of fault current during grid connected and islanded modes of operation, protecting DC m...
متن کاملEnsemble Classification and Extended Feature Selection for Credit Card Fraud Detection
Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...
متن کاملClassification of Customer’s Credit Risk Using Ensemble learning (Case study: Sepah Bank)
Banks activities are associated with different kinds of risk such as cresit risk. Considering the limited financial resources of banks to provide facilities, assessment of the ability of repayment of bank customers before granting facilities is one of the most important challenges facing the banking system of the country. Accordingly, in this research, we tried to provide a model for determinin...
متن کاملImproving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering
Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...
متن کاملA novel hybrid method for vocal fold pathology diagnosis based on russian language
In this paper, first, an initial feature vector for vocal fold pathology diagnosis is proposed. Then, for optimizing the initial feature vector, a genetic algorithm is proposed. Some experiments are carried out for evaluating and comparing the classification accuracies which are obtained by the use of the different classifiers (ensemble of decision tree, discriminant analysis and K-nearest neig...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- International Journal on Artificial Intelligence Tools
دوره 18 شماره
صفحات -
تاریخ انتشار 2009